Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 11760 |
| Missing cells | 1377 |
| Missing cells (%) | 0.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.5 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Boolean | 3 |
| Categorical | 6 |
UserID is highly correlated with montly_avg_comment_on_company_page | High correlation |
Yearly_avg_view_on_travel_page is highly correlated with Daily_Avg_mins_spend_on_traveling_page | High correlation |
total_likes_on_outofstation_checkin_received is highly correlated with Daily_Avg_mins_spend_on_traveling_page | High correlation |
montly_avg_comment_on_company_page is highly correlated with UserID | High correlation |
Daily_Avg_mins_spend_on_traveling_page is highly correlated with Yearly_avg_view_on_travel_page and 1 other fields | High correlation |
Yearly_avg_view_on_travel_page is highly correlated with total_likes_on_outofstation_checkin_received and 1 other fields | High correlation |
total_likes_on_outofstation_checkin_received is highly correlated with Yearly_avg_view_on_travel_page and 1 other fields | High correlation |
Daily_Avg_mins_spend_on_traveling_page is highly correlated with Yearly_avg_view_on_travel_page and 1 other fields | High correlation |
UserID is highly correlated with yearly_avg_Outstation_checkins and 1 other fields | High correlation |
yearly_avg_Outstation_checkins is highly correlated with UserID and 1 other fields | High correlation |
preferred_location_type is highly correlated with UserID and 1 other fields | High correlation |
Yearly_avg_view_on_travel_page has 581 (4.9%) missing values | Missing |
total_likes_on_outstation_checkin_given has 381 (3.2%) missing values | Missing |
Yearly_avg_comment_on_travel_page has 206 (1.8%) missing values | Missing |
UserID is uniformly distributed | Uniform |
UserID has unique values | Unique |
week_since_last_outstation_checkin has 1032 (8.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-24 11:48:43.090241 |
|---|---|
| Analysis finished | 2022-04-24 11:49:00.279074 |
| Duration | 17.19 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 11760 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005880.5 |
| Minimum | 1000001 |
|---|---|
| Maximum | 1011760 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 1000001 |
|---|---|
| 5-th percentile | 1000588.95 |
| Q1 | 1002940.75 |
| median | 1005880.5 |
| Q3 | 1008820.25 |
| 95-th percentile | 1011172.05 |
| Maximum | 1011760 |
| Range | 11759 |
| Interquartile range (IQR) | 5879.5 |
Descriptive statistics
| Standard deviation | 3394.963917 |
|---|---|
| Coefficient of variation (CV) | 0.003375116544 |
| Kurtosis | -1.2 |
| Mean | 1005880.5 |
| Median Absolute Deviation (MAD) | 2940 |
| Skewness | 0 |
| Sum | 1.182915468 × 1010 |
| Variance | 11525780 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1001473 | 1 | < 0.1% |
| 1006302 | 1 | < 0.1% |
| 1010400 | 1 | < 0.1% |
| 1008353 | 1 | < 0.1% |
| 1002212 | 1 | < 0.1% |
| 1000165 | 1 | < 0.1% |
| 1006310 | 1 | < 0.1% |
| 1004263 | 1 | < 0.1% |
| 1010408 | 1 | < 0.1% |
| 1008361 | 1 | < 0.1% |
| Other values (11750) | 11750 |
| Value | Count | Frequency (%) |
| 1000001 | 1 | |
| 1000002 | 1 | |
| 1000003 | 1 | |
| 1000004 | 1 | |
| 1000005 | 1 | |
| 1000006 | 1 | |
| 1000007 | 1 | |
| 1000008 | 1 | |
| 1000009 | 1 | |
| 1000010 | 1 |
| Value | Count | Frequency (%) |
| 1011760 | 1 | |
| 1011759 | 1 | |
| 1011758 | 1 | |
| 1011757 | 1 | |
| 1011756 | 1 | |
| 1011755 | 1 | |
| 1011754 | 1 | |
| 1011753 | 1 | |
| 1011752 | 1 | |
| 1011751 | 1 |
Buy_ticket
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 9864 | |
| True | 1896 | 16.1% |
| Distinct | 331 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 581 |
| Missing (%) | 4.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 280.8308435 |
| Minimum | 35 |
|---|---|
| Maximum | 464 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 182 |
| Q1 | 232 |
| median | 271 |
| Q3 | 324 |
| 95-th percentile | 411 |
| Maximum | 464 |
| Range | 429 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 68.18295849 |
|---|---|
| Coefficient of variation (CV) | 0.2427901352 |
| Kurtosis | -0.2870008263 |
| Mean | 280.8308435 |
| Median Absolute Deviation (MAD) | 45 |
| Skewness | 0.4144086403 |
| Sum | 3139408 |
| Variance | 4648.915828 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 262 | 190 | 1.6% |
| 255 | 186 | 1.6% |
| 270 | 179 | 1.5% |
| 217 | 165 | 1.4% |
| 232 | 160 | 1.4% |
| 225 | 148 | 1.3% |
| 240 | 142 | 1.2% |
| 247 | 139 | 1.2% |
| 285 | 136 | 1.2% |
| 277 | 133 | 1.1% |
| Other values (321) | 9601 | |
| (Missing) | 581 | 4.9% |
| Value | Count | Frequency (%) |
| 35 | 4 | |
| 42 | 5 | |
| 135 | 3 | < 0.1% |
| 136 | 9 | |
| 137 | 7 | |
| 138 | 3 | < 0.1% |
| 140 | 2 | < 0.1% |
| 141 | 3 | < 0.1% |
| 142 | 4 | |
| 143 | 7 |
| Value | Count | Frequency (%) |
| 464 | 1 | < 0.1% |
| 463 | 1 | < 0.1% |
| 462 | 2 | < 0.1% |
| 461 | 2 | < 0.1% |
| 460 | 3 | |
| 459 | 2 | < 0.1% |
| 458 | 1 | < 0.1% |
| 457 | 3 | |
| 456 | 5 | |
| 455 | 7 |
preferred_device
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 92.0 KiB |
| Mobile | |
|---|---|
| Laptop |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mobile |
|---|---|
| 2nd row | Mobile |
| 3rd row | Mobile |
| 4th row | Mobile |
| 5th row | Mobile |
Common Values
| Value | Count | Frequency (%) |
| Mobile | 10652 | |
| Laptop | 1108 | 9.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| mobile | 10652 | |
| laptop | 1108 | 9.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 7888 |
|---|---|
| Distinct (%) | 69.3% |
| Missing | 381 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28170.48176 |
| Minimum | 3570 |
|---|---|
| Maximum | 252430 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 3570 |
|---|---|
| 5-th percentile | 5854 |
| Q1 | 16380 |
| median | 28076 |
| Q3 | 40525 |
| 95-th percentile | 49945.5 |
| Maximum | 252430 |
| Range | 248860 |
| Interquartile range (IQR) | 24145 |
Descriptive statistics
| Standard deviation | 14385.03213 |
|---|---|
| Coefficient of variation (CV) | 0.510642035 |
| Kurtosis | 5.320921747 |
| Mean | 28170.48176 |
| Median Absolute Deviation (MAD) | 12034 |
| Skewness | 0.4896375725 |
| Sum | 320551912 |
| Variance | 206929149.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24185 | 12 | 0.1% |
| 11515 | 11 | 0.1% |
| 37870 | 10 | 0.1% |
| 18550 | 10 | 0.1% |
| 34195 | 9 | 0.1% |
| 5145 | 9 | 0.1% |
| 29015 | 9 | 0.1% |
| 7595 | 8 | 0.1% |
| 33250 | 8 | 0.1% |
| 44905 | 8 | 0.1% |
| Other values (7878) | 11285 | |
| (Missing) | 381 | 3.2% |
| Value | Count | Frequency (%) |
| 3570 | 2 | |
| 3577 | 1 | |
| 3578 | 1 | |
| 3605 | 2 | |
| 3611 | 1 | |
| 3614 | 1 | |
| 3618 | 1 | |
| 3620 | 1 | |
| 3621 | 1 | |
| 3631 | 1 |
| Value | Count | Frequency (%) |
| 252430 | 1 | |
| 152465 | 2 | |
| 152430 | 1 | |
| 52512 | 1 | |
| 52509 | 1 | |
| 52498 | 1 | |
| 52495 | 1 | |
| 52487 | 1 | |
| 52479 | 1 | |
| 52474 | 1 |
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 75 |
| Missing (%) | 0.6% |
| Memory size | 92.0 KiB |
| 1 | |
|---|---|
| 2 | |
| 10 | |
| 9 | 340 |
| 7 | 336 |
| Other values (25) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.360462131 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 4543 | |
| 2 | 844 | 7.2% |
| 10 | 682 | 5.8% |
| 9 | 340 | 2.9% |
| 7 | 336 | 2.9% |
| 3 | 336 | 2.9% |
| 8 | 320 | 2.7% |
| 5 | 261 | 2.2% |
| 4 | 256 | 2.2% |
| 16 | 255 | 2.2% |
| Other values (20) | 3512 |
Length
| Value | Count | Frequency (%) |
| 1 | 4543 | |
| 2 | 844 | 7.2% |
| 10 | 682 | 5.8% |
| 9 | 340 | 2.9% |
| 7 | 336 | 2.9% |
| 3 | 336 | 2.9% |
| 8 | 320 | 2.7% |
| 5 | 261 | 2.2% |
| 4 | 256 | 2.2% |
| 16 | 255 | 2.2% |
| Other values (20) | 3512 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
member_in_family
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 92.0 KiB |
| 3 | |
|---|---|
| 4 | |
| 2 | |
| 1 | |
| 5 | 384 |
| Other values (2) | 26 |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.006037415 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 4561 | |
| 4 | 3184 | |
| 2 | 2256 | |
| 1 | 1349 | 11.5% |
| 5 | 384 | 3.3% |
| Three | 15 | 0.1% |
| 10 | 11 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 4561 | |
| 4 | 3184 | |
| 2 | 2256 | |
| 1 | 1349 | 11.5% |
| 5 | 384 | 3.3% |
| three | 15 | 0.1% |
| 10 | 11 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 31 |
| Missing (%) | 0.3% |
| Memory size | 92.0 KiB |
| Beach | |
|---|---|
| Financial | |
| Historical site | |
| Medical | |
| Other | |
| Other values (2) |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.681302754 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Financial |
|---|---|
| 2nd row | Financial |
| 3rd row | Other |
| 4th row | Financial |
| 5th row | Medical |
Common Values
| Value | Count | Frequency (%) |
| Beach | 2424 | |
| Financial | 2409 | |
| Historical site | 1856 | |
| Medical | 1845 | |
| Other | 1386 | |
| Entertainment | 1173 | |
| Trekking | 636 | 5.4% |
| (Missing) | 31 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| beach | 2424 | |
| financial | 2409 | |
| historical | 1856 | |
| site | 1856 | |
| medical | 1845 | |
| other | 1386 | |
| entertainment | 1173 | |
| trekking | 636 | 4.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 206 |
| Missing (%) | 1.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.79002943 |
| Minimum | 3 |
|---|---|
| Maximum | 815 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 57 |
| median | 75 |
| Q3 | 92 |
| 95-th percentile | 108 |
| Maximum | 815 |
| Range | 812 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 24.02664966 |
|---|---|
| Coefficient of variation (CV) | 0.3212547159 |
| Kurtosis | 134.7851038 |
| Mean | 74.79002943 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 4.868224985 |
| Sum | 864124 |
| Variance | 577.2798937 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 96 | 192 | 1.6% |
| 66 | 191 | 1.6% |
| 90 | 190 | 1.6% |
| 56 | 188 | 1.6% |
| 80 | 184 | 1.6% |
| 72 | 183 | 1.6% |
| 95 | 180 | 1.5% |
| 92 | 179 | 1.5% |
| 88 | 177 | 1.5% |
| 79 | 176 | 1.5% |
| Other values (90) | 9714 | |
| (Missing) | 206 | 1.8% |
| Value | Count | Frequency (%) |
| 3 | 36 | |
| 31 | 29 | |
| 32 | 47 | |
| 33 | 38 | |
| 34 | 35 | |
| 35 | 46 | |
| 36 | 56 | |
| 37 | 54 | |
| 38 | 64 | |
| 39 | 65 |
| Value | Count | Frequency (%) |
| 815 | 1 | < 0.1% |
| 685 | 1 | < 0.1% |
| 615 | 1 | < 0.1% |
| 215 | 1 | < 0.1% |
| 125 | 7 | |
| 124 | 3 | < 0.1% |
| 123 | 8 | |
| 122 | 10 | |
| 121 | 11 | |
| 120 | 10 |
| Distinct | 6288 |
|---|---|
| Distinct (%) | 53.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6531.699065 |
| Minimum | 1009 |
|---|---|
| Maximum | 20065 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 1009 |
|---|---|
| 5-th percentile | 2132 |
| Q1 | 2940.75 |
| median | 4948 |
| Q3 | 8393.25 |
| 95-th percentile | 17861 |
| Maximum | 20065 |
| Range | 19056 |
| Interquartile range (IQR) | 5452.5 |
Descriptive statistics
| Standard deviation | 4706.613785 |
|---|---|
| Coefficient of variation (CV) | 0.7205803174 |
| Kurtosis | 0.9987327559 |
| Mean | 6531.699065 |
| Median Absolute Deviation (MAD) | 2195 |
| Skewness | 1.368578368 |
| Sum | 76812781 |
| Variance | 22152213.32 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2377 | 12 | 0.1% |
| 2342 | 11 | 0.1% |
| 2380 | 11 | 0.1% |
| 2610 | 10 | 0.1% |
| 2096 | 10 | 0.1% |
| 2570 | 9 | 0.1% |
| 2387 | 9 | 0.1% |
| 3452 | 9 | 0.1% |
| 2404 | 9 | 0.1% |
| 2437 | 9 | 0.1% |
| Other values (6278) | 11661 |
| Value | Count | Frequency (%) |
| 1009 | 2 | |
| 1014 | 1 | |
| 1017 | 1 | |
| 1050 | 1 | |
| 1051 | 2 | |
| 1052 | 2 | |
| 1055 | 1 | |
| 1058 | 1 | |
| 1060 | 1 | |
| 1061 | 2 |
| Value | Count | Frequency (%) |
| 20065 | 1 | |
| 20059 | 1 | |
| 20056 | 1 | |
| 20049 | 1 | |
| 20038 | 1 | |
| 20036 | 1 | |
| 20032 | 1 | |
| 20030 | 1 | |
| 20008 | 1 | |
| 20004 | 1 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.203571429 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 1032 |
| Zeros (%) | 8.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.616364893 |
|---|---|
| Coefficient of variation (CV) | 0.8167025306 |
| Kurtosis | -0.03827306777 |
| Mean | 3.203571429 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.9153335743 |
| Sum | 37674 |
| Variance | 6.845365252 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3070 | |
| 3 | 1766 | |
| 2 | 1700 | |
| 4 | 1118 | 9.5% |
| 0 | 1032 | 8.8% |
| 5 | 728 | 6.2% |
| 6 | 654 | 5.6% |
| 7 | 594 | 5.1% |
| 9 | 472 | 4.0% |
| 8 | 428 | 3.6% |
| Other values (2) | 198 | 1.7% |
| Value | Count | Frequency (%) |
| 0 | 1032 | 8.8% |
| 1 | 3070 | |
| 2 | 1700 | |
| 3 | 1766 | |
| 4 | 1118 | 9.5% |
| 5 | 728 | 6.2% |
| 6 | 654 | 5.6% |
| 7 | 594 | 5.1% |
| 8 | 428 | 3.6% |
| 9 | 472 | 4.0% |
| Value | Count | Frequency (%) |
| 11 | 60 | 0.5% |
| 10 | 138 | 1.2% |
| 9 | 472 | 4.0% |
| 8 | 428 | 3.6% |
| 7 | 594 | 5.1% |
| 6 | 654 | 5.6% |
| 5 | 728 | |
| 4 | 1118 | |
| 3 | 1766 | |
| 2 | 1700 |
following_company_page
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 103 |
| Missing (%) | 0.9% |
| Memory size | 23.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 103 |
| Value | Count | Frequency (%) |
| False | 8360 | |
| True | 3297 | 28.0% |
| (Missing) | 103 | 0.9% |
| Distinct | 160 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.66156463 |
| Minimum | 11 |
|---|---|
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 17 |
| median | 22 |
| Q3 | 27 |
| 95-th percentile | 36 |
| Maximum | 500 |
| Range | 489 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 48.66050382 |
|---|---|
| Coefficient of variation (CV) | 1.6977616 |
| Kurtosis | 59.66269923 |
| Mean | 28.66156463 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 7.684149905 |
| Sum | 337060 |
| Variance | 2367.844632 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 673 | 5.7% |
| 22 | 653 | 5.6% |
| 25 | 609 | 5.2% |
| 24 | 605 | 5.1% |
| 21 | 594 | 5.1% |
| 20 | 588 | 5.0% |
| 19 | 574 | 4.9% |
| 18 | 573 | 4.9% |
| 17 | 524 | 4.5% |
| 26 | 505 | 4.3% |
| Other values (150) | 5862 |
| Value | Count | Frequency (%) |
| 11 | 420 | |
| 12 | 396 | |
| 13 | 418 | |
| 14 | 480 | |
| 15 | 366 | |
| 16 | 408 | |
| 17 | 524 | |
| 18 | 573 | |
| 19 | 574 | |
| 20 | 588 |
| Value | Count | Frequency (%) |
| 500 | 1 | |
| 499 | 1 | |
| 497 | 1 | |
| 491 | 1 | |
| 490 | 2 | |
| 488 | 1 | |
| 487 | 1 | |
| 486 | 1 | |
| 485 | 1 | |
| 484 | 2 |
working_flag
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 9952 | |
| True | 1808 | 15.4% |
travelling_network_rating
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 92.0 KiB |
| 3 | |
|---|---|
| 4 | |
| 2 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 4 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 3672 | |
| 4 | 3456 | |
| 2 | 2424 | |
| 1 | 2208 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 3672 | |
| 4 | 3456 | |
| 2 | 2424 | |
| 1 | 2208 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
number_of_adults
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 92.0 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 4768 | |
| 2 | 1264 | 10.7% |
| 3 | 680 | 5.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 4768 | |
| 2 | 1264 | 10.7% |
| 3 | 680 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.81743197 |
| Minimum | 0 |
|---|---|
| Maximum | 270 |
| Zeros | 46 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 92.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 12 |
| Q3 | 18 |
| 95-th percentile | 31 |
| Maximum | 270 |
| Range | 270 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.070656619 |
|---|---|
| Coefficient of variation (CV) | 0.6564647206 |
| Kurtosis | 93.94396127 |
| Mean | 13.81743197 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 4.480682458 |
| Sum | 162493 |
| Variance | 82.2768115 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 1126 | 9.6% |
| 9 | 676 | 5.7% |
| 8 | 662 | 5.6% |
| 6 | 624 | 5.3% |
| 7 | 554 | 4.7% |
| 13 | 532 | 4.5% |
| 11 | 530 | 4.5% |
| 12 | 500 | 4.3% |
| 14 | 496 | 4.2% |
| 15 | 480 | 4.1% |
| Other values (42) | 5580 |
| Value | Count | Frequency (%) |
| 0 | 46 | 0.4% |
| 1 | 336 | |
| 2 | 146 | 1.2% |
| 3 | 218 | 1.9% |
| 4 | 330 | |
| 5 | 444 | |
| 6 | 624 | |
| 7 | 554 | |
| 8 | 662 | |
| 9 | 676 |
| Value | Count | Frequency (%) |
| 270 | 1 | < 0.1% |
| 235 | 1 | < 0.1% |
| 170 | 1 | < 0.1% |
| 135 | 1 | < 0.1% |
| 47 | 1 | < 0.1% |
| 46 | 3 | < 0.1% |
| 45 | 4 | |
| 44 | 8 | |
| 43 | 4 | |
| 42 | 6 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| UserID | Buy_ticket | Yearly_avg_view_on_travel_page | preferred_device | total_likes_on_outstation_checkin_given | yearly_avg_Outstation_checkins | member_in_family | preferred_location_type | Yearly_avg_comment_on_travel_page | total_likes_on_outofstation_checkin_received | week_since_last_outstation_checkin | following_company_page | montly_avg_comment_on_company_page | working_flag | travelling_network_rating | number_of_adults | Daily_Avg_mins_spend_on_traveling_page | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1000001 | Yes | 307.0 | Mobile | 38570.0 | 1 | 2 | Financial | 94.0 | 5993 | 8 | Yes | 11 | No | 1 | 0 | 8 |
| 1 | 1000002 | No | 367.0 | Mobile | 9765.0 | 1 | 1 | Financial | 61.0 | 5130 | 1 | No | 23 | Yes | 4 | 1 | 10 |
| 2 | 1000003 | Yes | 277.0 | Mobile | 48055.0 | 1 | 2 | Other | 92.0 | 2090 | 6 | Yes | 15 | No | 2 | 0 | 7 |
| 3 | 1000004 | No | 247.0 | Mobile | 48720.0 | 1 | 4 | Financial | 56.0 | 2909 | 1 | Yes | 11 | No | 3 | 0 | 8 |
| 4 | 1000005 | No | 202.0 | Mobile | 20685.0 | 1 | 1 | Medical | 40.0 | 3468 | 9 | No | 12 | No | 4 | 1 | 6 |
| 5 | 1000006 | No | 240.0 | Mobile | 35175.0 | 1 | 2 | Financial | 79.0 | 3068 | 0 | No | 13 | No | 3 | 0 | 8 |
| 6 | 1000007 | No | NaN | Mobile | 46340.0 | 1 | Three | Medical | 81.0 | 2670 | 4 | Yes | 20 | Yes | 1 | 3 | 12 |
| 7 | 1000008 | No | 225.0 | Mobile | NaN | 24 | 1 | Financial | 67.0 | 2693 | 1 | No | 22 | Yes | 2 | 1 | 1 |
| 8 | 1000009 | No | 285.0 | Mobile | 7560.0 | 23 | 3 | Financial | 44.0 | 9526 | 0 | No | 21 | Yes | 2 | 0 | 10 |
| 9 | 1000010 | No | 270.0 | Mobile | 45465.0 | 27 | 3 | NaN | 94.0 | 5237 | 6 | No | 13 | No | 2 | 2 | 17 |
Last rows
| UserID | Buy_ticket | Yearly_avg_view_on_travel_page | preferred_device | total_likes_on_outstation_checkin_given | yearly_avg_Outstation_checkins | member_in_family | preferred_location_type | Yearly_avg_comment_on_travel_page | total_likes_on_outofstation_checkin_received | week_since_last_outstation_checkin | following_company_page | montly_avg_comment_on_company_page | working_flag | travelling_network_rating | number_of_adults | Daily_Avg_mins_spend_on_traveling_page | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11750 | 1011751 | No | 231.0 | Mobile | 16423.0 | 28 | 4 | Historical site | 96.0 | 3845 | 1 | No | 26 | No | 2 | 0 | 12 |
| 11751 | 1011752 | Yes | 383.0 | Mobile | 14399.0 | 28 | 3 | Other | 58.0 | 10910 | 6 | Yes | 28 | No | 2 | 1 | 23 |
| 11752 | 1011753 | No | 302.0 | Mobile | 25317.0 | 24 | 1 | Other | 79.0 | 12093 | 0 | No | 24 | No | 1 | 1 | 29 |
| 11753 | 1011754 | No | 247.0 | Mobile | 11418.0 | 5 | 3 | Historical site | 99.0 | 9983 | 1 | No | 28 | No | 2 | 0 | 16 |
| 11754 | 1011755 | No | 210.0 | Mobile | 40886.0 | 5 | 3 | Other | 53.0 | 3024 | 2 | No | 32 | No | 4 | 0 | 14 |
| 11755 | 1011756 | No | 279.0 | Laptop | 30987.0 | 23 | 2 | Historical site | 58.0 | 2616 | 4 | No | 36 | No | 3 | 1 | 23 |
| 11756 | 1011757 | No | 305.0 | Mobile | 21510.0 | 6 | 1 | Historical site | 55.0 | 10041 | 4 | No | 30 | No | 1 | 1 | 11 |
| 11757 | 1011758 | No | 214.0 | Mobile | 5478.0 | 4 | 3 | Beach | 103.0 | 6203 | 3 | Yes | 40 | Yes | 2 | 1 | 12 |
| 11758 | 1011759 | No | 382.0 | Laptop | 35851.0 | 2 | 3 | Historical site | 83.0 | 5444 | 3 | No | 32 | No | 4 | 0 | 20 |
| 11759 | 1011760 | No | 270.0 | Mobile | 22025.0 | 8 | 3 | Historical site | 104.0 | 4470 | 2 | No | 29 | No | 1 | 0 | 14 |